Noise-tolerance feasibility for restricted-domain Information Retrieval systems
نویسندگان
چکیده
منابع مشابه
Noise-tolerance feasibility for restricted-domain Information Retrieval systems
Information Retrieval systems normally have to work with rather heterogeneous sources, such as Web sites or documents from Optical Character Recognition tools. The correct conversion of these sources into flat text files is not a trivial task since noise may easily be introduced as a result of spelling or typeset errors. Interestingly, this is not a great drawback when the size of the corpus is...
متن کاملNoise-tolerance feasibility for restricted-domain Information Retrieval systems
Information Retrieval systems normally have to work with rather heterogeneous sources, such as Web sites or documents from Optical Character Recognition tools. The correct conversion of these sources into flat text files is not a trivial task since noise may easily be introduced as a result of spelling or typeset errors. Interestingly, this is not a great drawback when the size of the corpus is...
متن کاملPublic Transport Ontology for Passenger Information Retrieval
Passenger information aims at improving the user-friendliness of public transport systems while influencing passenger route choices to satisfy transit user’s travel requirements. The integration of transit information from multiple agencies is a major challenge in implementation of multi-modal passenger information systems. The problem of information sharing is further compounded by the multi-l...
متن کاملUsing Domain Ontologies for Efficient Information Retrieval
Being the conceptual models that capture domain knowledge, ontologies can be looked upon for aiding meaningful information retrieval. This paper is an effort to improve the relevancy of results in a search system for a domain by exploiting the domain knowledge captured in an OWL DL Ontology. We propose a system that fits the query terms in the ontology graph in an appropriate way and exploits t...
متن کاملInformation Retrieval Systems Adapted to the Biomedical Domain
The terminology used in Biomedicine shows lexical peculiarities that have required the elaboration of terminological resources and information retrieval systems with specific functionalities. The main characteristics are the high rates of synonymy and homonymy, due to phenomena such as the proliferation of polysemic acronyms and their interaction with common language. Information retrieval syst...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Data & Knowledge Engineering
سال: 2013
ISSN: 0169-023X
DOI: 10.1016/j.datak.2013.02.002